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Introduction 

We derive sharp bounds on the rate of relaxation to equilibrium for two models of random 
collisions connected with the Boltzmann equation, as well as several other stochastic evolutions of 
a related type. In fact, there is a fairly broad class of models to which the methods used here may be 
applied. The starting point is a model due to Mark Kac [10] of random energy preserving "molecular 
collisions" , and its analysis provides the pattern for the analysis of all of the models discussed here, 
including a more physically realistic model of random energy and momentum conserving collisions. 
However, since the features of the Kac model have motivated the method of analysis presented 
here, we begin by introducing it. 

The Kac model represents a system of N particles in one dimension evolving under a random 
collision mechanism. It is assumed that the spatial distribution of the particles is uniform, so that 
the state of the system is given by specifying the N velocities vi, t>2, • • • , v n- The random collision 
mechanism under which the state evolves is that at random times Tj , a "pair collision" takes place 
in such a way that the total energy 

N 

E = E v l ( L1 ) 

k=i 

is conserved. Since only a pair of one dimensional velocities is involved in each collision, there 
are just two degrees of freedom active, and if the collisions were to conserve both energy and 
momentum, the only possible non-trivial result of a collision would be an exchange of the two 
velocities. Since Kac sought a model in which the distribution of the velocities would equilibriate 
over the energy surface specified by (1.1), he dropped the requirement of momentum conservation, 
and retained only energy conservation. 

With energy conservation being the only constraint on a pair collision, the kinematically possible 
"post-collisional" velocities when particles i and j collide, v* and v * , are of the form 

v*(0) = Vi cos(0) + vj sin(0) and v*(6) = -Vi sin(0) + vj cos(0) (1.2) 

where, of course, Vi and Vj are the pre-collisional velocities, and £ (—tt,tt]. 

To specify the evolution, consider it first in discrete time, collision by collision. Let 

v(k) = (v 1 (k),v 2 (k),...,v N (k)) (1.3) 

denote the state of the system just after the kth collision. Evidently, v(k) is a random variable 
with values in S N ~ 1 (s/E), the sphere in R N of radius >/~E, where E is the energy. Let (j) be any 
continuous function on S N ~ 1 (y/E). We will specify the collision mechanism by giving a formula 
for computing the conditional expectation of 4>(v(k + 1)) given v(k), which defines the one-step 
Markov transition operator Q through 

Qcj)(v) = E{(f>(v{k + 1)) | v(k) = v } . (1.4) 

In the collision process to be modeled, the pair {i, j}, i < j, of molecules that collide is to be 
selected uniformly at random. Then the velocities Vi and Vj are updated by choosing an angle 
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9, and letting (1.2) define the post-collisional velocities. Let p(0) be a probability density on the 
circle; i.e, 



/: 



p(e)de = 1 , (i.5) 



and take p to be the probability density for the outcome that the collision results in post-collisional 
velocities v*(9) and v*{9) as in (1.2). 

Then for any function <p on IR n , and hence by restriction on S N ~ 1 (s/E), 

E{J>(v(k)) I v(k-l)=v}= Y, f P(0)f(v u v 2 ,...,v*(9),...,v*(9),...,v n )d9 , 

and thus 

Q^{v)=( N \ ^ p(0)Hvi,V2,---,v*(9),...,v*(9),...,v n )d9 . (1.6) 

This expression can be clarified if for each i < j we let Rij(9) denote the rotation in IR N that 
induces a clockwise rotation in the Vi,vj plane through an angle 9, and fixes the orthogonal com- 
plement of this plane. Then Rij(9)v is the post-collisional velocity vector corresponding to the 
pre-collisional velocity vector v, and (1.6) can be rewritten as 



-1 N 

/ \ \ 

Q<t>{v 



(*) r p(9)<t>(Ri,j(9)v)d9 . (1.7) 

v / i<j J ~ n 

Let T~Ln,e denote the Hilbert space of square integrable functions / on the sphere S N ~ l (\fE) 
equipped with the normalized uniform measure d//jv. Let (•, •) and || • || denote the inner product 
and norm on TLn,e- We now require that p(9) = p{—9) so that Q is self adjoint on TLn,e- We also 
require that p be continuous and strictly positive at 9 = 0, which is a convenient condition ensuring 
that Q is ergodic. 

It is clear from (1.7) that Q is an average over isometries on Hn,e, and hence is a contraction. 
Moreover, ||Q/||2 = II /lb if and only if / is constant by our ergodicity assumptions on p. 

Because Q is self adjoint, it updates the probability density ff. for v as well. Indeed, for any test 
function 4>, 

[ <f>(v)f k+1 (v)dp N = E^vik + 1)) 

JS N - 1 (-/E) 

= E {E{4>{v{k + 1) | v(k) }) = EQ<f>(y(k)) 
= _ 4>(v)Qfk(v)dp N 

JS N - 1 {-/E) 

which of course means that Qfk = fk+i- 

One passes to a continuous time description by letting the waiting times between collisions 
become continuously distributed random variables. To obtain a Markov process, the distribution 



3 



of these waiting times must be memoryless, and hence exponential. Therefore, fix some parameter 
tn > 0, and define the Markovian semigroup Gt, t > 0, by 



oo 



G t f = e-(*/™> V { ^j^Q k f = eW™™- 1 ) f , 

K. 

k=0 

which gives the evolution of the probability density for v, continuously in the time t. 

It remains to specify the dependence of tn on N. Let denote the waiting time between 

collisions in the iV-particle model. Suppose that the waiting time for any given particle to undergo 
a collision is independent of N, which corresponds roughly to adjusting the size of the container 
with N so that the particle density remains constant. Suppose also that these waiting times are all 
independent of one another, which should be reasonable for a gas of many particles. (See Kac [10] 
for further discussion.) Then we would have Pr\T^ N ^ > t} = Pr{T^ > t} N , or e~ tTN = e~ NT / Tl . 
Changing the time scale, we put t\ = 1 and hence tjv = 1/N. Therefore, the semigroup is given 
by G t = e tN (Q~ T \ For any initial probability density /o, f(v,t) = G t fo(v) solves Kac's Master 
Equation 

d 

-f(v,t) = N(Q-I)f(v,t) , (1.8) 

which is the evolution equation for the model in so far as we are concerned with the probability 
density f(v, t) for the velocities at time t, and not the velocities v(t) themselves, which are random 
variables. 

Because of the ergodicity, if /o is any initial probability density for the process, it is clear that 
limt^oo G t fo = 1. The question is how fast this relaxation to the invariant density 1 occurs. To 
quantify this, define 



2 = 1,(/,1)=0 . (1.9) 



X N = sup |^(/, Qf) 

If this quantity is less than one, there is a "gap" in the spectrum of Q, and hence the spectral gap 
of Q is defined to be 1 — Xn- Clearly then, the spectral gap for N(Q — I) is 

A N = N(l - X N ) . (1.10) 

This quantity is of interest in quantifying the rate of relaxation of Gtfo to 1 since for any square 
integrable initial probability density /o, 

||G t (/ -l)||2<e- tAjv ||/o-l||2 • 

Mark Kac, who introduced this operator and process [10] in 1956, observed that for each fixed 
£, the subspace of spherical harmonics of degree £ in S N ~ 1 (\^E) is an invariant subspace under Q. 
(This is especially clear from (1.7) since if 4> is i n such a subspace, then so is <p o Ri^iO) for each 
pair i < j and each angle 6). Since each of these subspaces is finite dimensional, Q has a pure point 
spectrum. He remarks that it is not even evident that > for all N, much less that there is a 
lower bound independent of TV. (As Diaconis and Saloff-Coste noted in [5], Q is not compact.) He 
nonetheless conjectured that 

liminf Ajv = C > . (1.11) 
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Kac's conjecture in this form, for the special case p = 1/2-k considered explicitly by Kac, was 
recently proved by Janvresse [9] using Yau's martingale method [13], [14]. Her proof gives no 
information on the value of C. One result we prove here is that in the case p = l/2n, 

A N ='-^±1, (1.12) 

27V-1 ' v ' 

and hence 

liminf A N = - . (1.13) 

The result (1.12) has also been obtained by Maslin in unpublished work, using entirely different 
methods. Some account of Maslin's results can be found in a paper [5] by Diaconis and Saloff-Coste 
in which it is shown that A^v > C/N 2 for the Kac model as well as a natural generalization of it in 
which the sphere S N is replaced by the special orthogonal group SO(N). Our method gives exact 
results in this case too, as we shall see. 

Maslin's approach was based on the representation theory of the group SO(N), and does not seem 
to extend to more general cases, such as a non-uniform density p(9), or to momentum conserving 
collisions. According to his former thesis advisor, Persi Diaconis, this is one reason it was never 
published. We will comment further on the relation of our paper to previous work, especially [9], 
[13], [14] and [5], in section 3 where we carry out our analysis of the Kac model, and in section 6 
where we analyze the SO(N) variant of the Kac model. 

Kac did not explicitly conjecture (1.13), only (1.11), though he discussed motivations for his 
conjecture that do suggest (1.13). In particular, he was motivated by a connection between the 
many-particle evolution described by the Master equation (1.8), and a model Boltzmann equation, 
and he did rigorously establish the following connection: For each integer k, 1 < k < N, let ir k be 
the kth coordinate projection on S N ~ 1 (\^E); i.e., 

TT k (v 1 ,V 2 ,. ■ ■ ,V N ) = V k ■ (1.14) 

Then define the orthogonal projections P k through 

(Pkf °-K k ,goTi k ) = (f,gOTT k ) 



for all / in TCn.e, and all continuous bounded functions g on \—y/E,y/E\. That is, P k is the 
orthogonal projection onto the subspace in TCn,e consisting of functions of the form g o ir k . In 
probabilistic language, P k f is the conditional expectation of / given v k ; i.e., 

P k f(v) = E{ / | v k = v } , (1.15) 

and when / is a probability density on S N ~ 1 (\fE), P k f is its kth single particle marginal. Kac 
showed that if a sequence of initial densities /q N ^ on S N ~ 1 (y/E) satisfies a certain symmetry and 
independence property that he called "molecular chaos" , and if furthermore 

g(v) = Jim Pi/ (u) 
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exists in L l {IR), then so does g(v,t) = lini/v^oo -Pi (Gtfo(v)), and g(v,t) satisfies the Kac Equation 








(v,t)=2f ( [ [g(v*(9),t)g(w*(9),t)-g(v,t)g(w,t)]dw) p(6)d6 . (1.16) 



-TT \J]R 



The fact that there is a quadratic non-linearity on the right is due to the fact that the underlying 
many particle dynamics is generated by pair collisions. The factor of 2 on the right hand side 
comes from the 2 in the the normalization factor 2/N(N — 1) in the definition of Q, (1.6). The N 
is absorbed by the factor of N in N(Q — I), the generator of Gt, and the TV — 1 is absorbed by 
summing over all the N — 1 particles with which the first particle can collide. 

Kac's limit theorem provides a direct link between the linear but many particle Master equation 
(1.8) and the one variable but non-linear Kac equation (1.16). Kac's proposal was that one should 
be able to obtain quantitative results about the behavior of the Master equation, and from these, 
deduce quantitative results on the Kac equation (1.16). Specifically, he was concerned with following 
this route to results on the rate of relaxation to equilibrium for solutions of (1.16). 

It is easy to see that for any j3 > 0, 



is a steady state solution of the Kac equation (1.16). (In the kinetic theory context, the Gaussian 
density in (1.17) is known as the Maxwellian density with temperature 1//3). Indeed, as is well 
known, nip is the limit of the single particle marginal on S N ~ l (yJ N / [3) as N tends to infinity. 
Kac wanted to show that for any reasonable initial data g(v), the Kac equation had a solution 
g(v,t) with lim^oo g(v, t) = mp{v) where J M v 2 g(v)dv = 1/(3. Indeed, he wanted to show that 
this convergence took place exponentially fast, and he boldly conjectured that one could prove this 
exponential convergence for the Master equation from whence (1.16) came. At the time Kac wrote 
his paper, very little was known about the non-linear Boltzmann equation, Carleman's 1933 paper 
[2] being one of the few mathematical studies. Given the difficulties inherent in dealing directly 
with the non-linear equation, his suggested approach via the Master eqaution was well motivated, 
though unfortunately he did not suceed himself in obtaining quantitative relaxation estimates by 
this route, and other workers choose to directly investigate the non-linear equation. 

Evidence for the conjectured exponential convergence came from linearizing the Kac equation 
about the steady state solutions mp. The resulting generator of the linearized Kac equation can 
be written in terms of averages of Mehler kernels, as shown in [12], and so all of the eigenfunctions 
are Hermite polynomials (as Kac had observed in section 9 of [10]). The eigenvalue corresponding 
to the nth degree Hermite polynomial, n > 1, is then readily worked out to be (see [12]): 



The eigenvalue is zero for n = 2, corresponding to conservation of energy. As we have indicated, 
Kac actually only considered the special case in which p was uniform; i.e., p(6) = 1/2tt. In this 




(1.17) 




(1.18) 
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case, the eigenvalues are —1 for n odd, and are monotonically decreasing toward — 1 for n even. 
Thus, the eigenvalue corresponding to the to fourth degree Hermite polynomial determines the 
spectral gap for the linearization of (1.16) in this case. The fact that this gap is 1/2 is consistent 
with (1.13), and bears out Kac's intuition that there is a close quantitative connection between his 
Master equation (1.8), and the Kac equation (1.16). 

In fact, as we shall see, in the case considered by Kac and some other cases as well, Xn is an 
eigenvalue of Q of multiplicity one, and QfN{v) = Ajv/jv(u) for 

N 

f N ( Vl ,...,v N ) = 52{v$-{l,vp) . (1.19) 

.7 = 1 

If E = N and g^ is defined by Pi(/jv) = 9n ° ^n, then 

lim g N (v) = mi(v)/i( 4 )(«) , (1.20) 

N— >oo 

where /i( 4 ) is the fourth degree Hermite polynomial for the standard unit variance Gaussian measure 
on IR. (This is fairly evident, but will be fully evident in view of the formula for Pi given in section 
2.) Thus, the correspondence between the spectral gaps extends to a correspondence between the 
eigenfunctions too. 

McKean [12] and Gruenbaum [7], [8] have further investigated these issues. In particular, McK- 
ean conjectured that reasonable solutions of (1.16) should relax to the Gaussian stationary solutions 
of the same energy in L 1 at the exponential rate e - */ 2 corresponding to the spectral gap in the 
linearized equation. He proved this for nice initial data but with exponential rate e~ tc where c is 
explicit constant, but about an order of magnitude smaller than 1/2. Later, in [4] this result was 
established with the almost the sharp rate, i.e., e - ^ 1 / 2 - 1- ^* for nice initial data. See the papers 
for precise statements, but note that all of this is in the case p = 1/2tt. (The results are stated 
differently in [12] and [4], which use a different time scale so that the factor of 2 in (1.16) is absent). 

If one expects that the linearized version of (1.16) is a good guide to the behavior solutions of 
(1.16), one might guess that (1.18) provides a good guide to the relaxation properties of solutions 
of (1.18). This would suggest that in the case in which p is uniform, the slowest mode of relaxation 
corresponds to initial data of the form mi(v)(l + eh^(v))) for small e. 

If one further believed that the non-linear Kac equation (1.16) is a good guide to behavior of 
solutions of Kac's Master equation, then one might guess that the slowest mode of relaxation for 
the Master equation is a symmetric fourth degree ploynomial, at least for uniform p. Such a line 
of reasoning suggests (1.19) as a candidate for the slowest mode of relaxation for Kac's Master 
equation. This turn out to be correct, as we have indicated, and this shows how well-constructed 
the Kac model is: A great deal of information is washed out and lost whenever one passes from the 
-/V-particle distribtion function f(v) to its single particle marginal distribution g(v). In general, 
there would be no reason to expect that the slowest mode of decay for the Master equation would 
not be lost in passing to the marginal. 

Indeed, it is easy to see that /n is in fact an eigenfunction of Q. We shall see that for many 
choices of p, f^ is the optimizer in (1.9). This correspondence between Kac's Master equation (1.8) 
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and the linearized version of the Kac equation (1.16) is a full vindication of Kac's conjectures. It 
also shows that his model is free of extraneous detail at the microscopic level; what happens at 
the microscopic level described by the Master equation is what happens at the level described by 
(1.16). 

We conclude the introduction by briefly stating our results for the Kac model itself, and then 
describing the structure of the paper. The key result in our analysis of the Kac model is the 
following theorem which reduces the variational problem (1.9) to a much simpler, purely geometric, 
one dimensional problem: 



Theorem 1.1 For all N > 3, 



where 

UN = sup 



O7Tl,0O 7T 2 



\9° 7Tl||2 



A N > (1 - k jv )A jv _ 1 (1.21) 
</€C([-l,l]) , <0O7n,l> = O 1 (1.22) 



Notice first of all that g is a function of a single variable - in contrast to (1.9), (1.22) is a one- 
dimensional variational problem. Also notice that (1.22) doesn't involve p, or otherwise directly 
refer to Q. 

The bound in Theorem 1.1 implies that liminfAr^oo ^-n > 1X^=3 (1 ~~ K j)A 2 . Since the necessary 
and sufficient condition for the infinite product to be non-zero is that 



5^Kj<oo, (1.23) 

J=3 



proving the Kac conjecture is reduced to the problem of proving the summability of Kj and the 
strict positivity of A 2 . 

The second part is easy, since for two particles, Q is an operator on functions on S 1 . Indeed 

(/, Qf) = ^ f_ f_ fWfty ~ 9)p(9)d9d^ , (1.24) 
and writing this in terms of Fourier series leads to 

A 2 = sup(/ p(6)cos(k9)de\ . (1.25) 

fe#0 U-7T J 

By the Riemann-Lebesgue lemma, A 2 < 1, and hence A 2 = 2(1 — A 2 ) > 0. 

As for the summability of Kjy, note from (1.22) that is a measure of the dependence of the 
coordinate functions on the sphere. With E = N, the marginal distribution of (v 1 ,v 2 ) induced by 
p, N is 

ICJV-3I / ,,2 , 2\ (iV-4)/2 
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As N tends to infinity, this tends to 



_L e -(vlW 2 )/2 dvidv2 

and under this limiting measure, the two coordinate functions v 1 and v 2 are independent. Hence 
for any admissible trial function g in (1.22), 

lim (5071-1,507^) = 



^ / g^g^e-^+^^dv^ = 
2tt Jm 2 

4=/ 9(«i)e-" ?/2 d^ / 5 (t; 2 )e-^/ 2 d W2 = 

V27T JjR V27T Jm 

lim (5 7ri,l)(5 7r 2 ,l) = 

N^oo 

which implies that limAr^^ km = 0, without, however, showing how fast. 
In fact, it is not hard to compute k^: 



Theorem 1.2 For all N > 3, 

K N 



(1.26) 



N 2 - 1 



Since this is summable, (1.23) holds, and so the Kac conjecture is proved. But Theorem 1.2 tell 
us much more than just (1.23). One can exactly solve the recurrence relation in Theorem 1 with 
kn = 3/(N 2 — 1). As we shall see, this leads to: 



Theorem 1.3 For all N > 2, 



A W > 1 -^ W + 2 . (1.27) 
2 N -1 V ; 



Moreover, this result is sharp for the case considered by Kac; i.e., constant density p, in which case 
A2 = 0, and more generally whenever 

/TV PIT 
p{6) cos(fc0)d0 < / p{6) cos(40)d0 (1.28) 
-7T J —TV 

for all k^=0. In all of these cases, \n has multiplicity one, and the corresponding eigenfunction is 

N 

EW" • (1-29) 



fc=i 



Notice that An does not depend on E. This is easy to see directly from the fact that Q commutes 
with the unitary change of scale that relates Hn,e and T~^n,e' for two different values, E and E', of 
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the energy. We have kept it present in the discussion until now on account of the relation between 
the uniform probability measure on S N ~ 1 (s/E), and the unit Gauss measure on the real line, or 
on ZR 2 as in (1.26). Having said what we wish to say about this, it will be simplest to henceforth 
set E = 1, and to delete it from our notation. 

The division of our results on the original Kac model into Theorems 1.1, 1.2 and 1.3 of course 
reflects the steps in the method by which they are obtained. However, it also reflects a point of 
physical relevance, namely that is completely independent of p{6). The complicated details of 
the collision mechanism do not enter into k>n- Rather, they enter our estimate for Ajv only through 
the the value of the two particle gap A2 = 2(1 — A2). Once this is computed, there is a purely 
geometric relation between the values of the gap for different values of N. The fact that there 
should be such a simple and purely geometric relation between the values of the gap for different 
values of N is a very interesting feature of the Kac model which expresses the strong sense in which 
it is a binary collision model. 

The paper is organized as follows: In Section 2 we identify the general features of the Kac 
model that enable us to prove Theorem 1.1. We then introduce the notion of a Kac system, which 
embodies these features, and prove the results that lead to analogs of Theorem 1.1 for general 
Kac systems. This provides a convenient framework for the analysis of a number of models, as we 
illustrate in the next four sections. Section 3 is devoted to the Kac model itself, and contains the 
proofs of Theorems 1.1, 1.2 and 1.3. Section 4 is devoted to the analysis of the master equation for 
physical, three dimensional, momentum and energy conserving Boltzmann collisions. Sections 5 is 
devoted to a shuffling model that has been studied in full detail by Diaconis and Shahshahani [6]. 
We include this here because it can be viewed as the Kac model with momentum conservation, and 
is very simple. (We hasten to add that Diaconis and Shahshahani do much more for this model 
than compute the spectral gap). Then in Section 6 we treat another generalization of the Kac 
model, this time in the direction of greater complexity: The SO(N) model of Maslin, Diaconis and 
Saloff-Coste [5]. Finally, in Section 7 we show that the quartic eigenfunction (1.19) is indeed the 
gap eigenfunction for a wide range of nonuniform densities p that violate (1.28). 

2: General Features 

The Kac model introduced in the previous section has the following general features that are 
shared by all of the models discussed here: 

Feature 1: For each N > 1 there is measure space (Xn, Sn, Pn), with fi^ & probability measure, 
on which there is a measure preserving action ofUpj, the symmetric group on N letters. We denote 

H N = L 2 (X N ,p N ) . (2.1) 

We think of Xjv as the ll N particle phase space" or "iV particle state space", and the action of 
Hn as representing "exchange of particles". In the Kac model, Xn is S N ~ l , Sn is the Borel field, 
and /Ujv is the rotation invariant probability measure on S 1 ^ -1 = Xn- A permutation a € Hn acts 
onljv through 

a(v 1 ,v 2 , ...,v N ) = (tv ( i) , v a{2 ) , ■ ■ -,v a ( N )) ■ 
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Feature 2: There is another measure space (Yn,Tn,vn) and there are measureable maps -Kj 
Xn — ► Yn for j = 1, 2, . . . , N such that for all a 6 IIjv, and each j, 



TT j oa = TT a(j) 



(2.2) 



Moreover, for each j, and all A £ T N , 



We denote 



u N {A) = UNinJ^A)) . 
JCn = L 2 (Y N , v N ) . 



(2.3) 
(2.4) 



We think of iTj(x) as giving the "state of the jth particle when the N particle system is in state 
x" . For example, in the Kac model, we take 



n j (v 1 ,v 2 , ■ ■ -,v N ) = Vj G [-1, 1] 



(2.5) 



and thus we take Yn = [—1, 1]. In this case, Yn does not depend on N, and it may seem strange 
to allow the single particle state space itself to depend on N. However, the methods we use here 
permit this generality, and some of the examples considered here require it. 

Notice that once Yn and the ttj are given, vn is specified through (2.3). In the Kac model we 
therefore have 

jJV-21 



v N (v) 



\S l 



\ S N- 



l(l_ v 2 )( 7V-3)/2 dt; 



(2.6) 



Feature 3: For each N > 3 and each j = 1, 2 . . . , N, there is a map 

<t>i ■ (Xn-i x Y N ) — »■ X N 

so that 



(2.7) 



7T 3 (<P J (x,y))=y (2.8) 
for all j = 1, . . . , N and all (x, y) £ Xn-i x Yn- Moreover, <pj has the property that for all A £ Sn, 

ItiN-i ® v N \ {^{A)) = fi N (A) , (2.9) 

or equivalently, for all bounded measurable functions f on Xn, 



j fdn N = / f{4> 3 {x,y))d.HN-i{x) 



dv N {y) ■ 



(2.10) 



In the Kac model case, for any v £ Xn-i = S N 2 and any v £ Yn = [—1, 1] we put 

4>n(v,v) = (\/T- v 2 v,v) , 
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(2.11) 



and (2.9) is easily verified. 

So far, none of the features we have considered involve the dynamics. That is, the first three 
features are purely kinematical. The fourth feature brings in the Markov transition operator Q. 
We do not make the dependence of Q on N explicit in our notation, since this will always be clear 
from the context. 

Feature 4: For each N > 2, there is a self adjoint and positivity preserving operator Q on TCn 
such that Ql = 1. These operators are related to one another by the following: For each N > 3, 
each j = 1, 2, . . . , N, and each square integrable function f on Xn, 

1 N r 

(f, Qf)n N = E / «£.v> Qkv)n N -,) dMv) (2-12) 
where for each j and each y £ , 

hv(-) = f(4>j(;y)) • (2-i3) 



It is easily verified that the Kac model posseses this feature. 

Definition: A Kac System is a system of probability spaces (Xjy, Sn, [In) and (Yjy, 7jv, vn) for 
iV G IN, N > 2, together with, for each TV, maps TTj and (f)j, j = 1, 2, . . . , N, a measure preserving 
action of Hn on (Xn,Sn, H>n), and a Markov transition operator Q on H = L 2 (Xn, /j,n), related 
to one another in such a way that they possess all of the properties specified in features 1 through 
4 above. 

In analyzing the spectral gaps of the operators Q in Kac systems, certain other operators related 
to conditional expectations will play a central role, as indicated in the previous section. Suppose 
that (Xn, <Sn, [J-n), {Yn,Tn,vn), ftj, and (f>j, are defined and related as specified above. For each 
j = 1, 2, . . . , TV, let Pj be the orthogonal projection onto the subspace of TCn consisting of functions 
of the form g o Wj for some g £ /Cjv- Then, with y = iTj(x) and fj tV given by (2.13), 

p jf( x ) = 9(^j(x)) where g(y) = / / Jiy (5)d/xjv_i(x) . (2.14) 

J Xn-i 

In terms of these projections, define 

1 N 

which is clearly a positive contraction on Htv- Define a contraction K on /Cjv by 

(Kg) o tt n = P N (g o ttjv.!) . (2.16) 
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Note that Kg(y) is the conditional expectation of g o tt 2 given that tt\ = y. That is, 



Kg(y) = E{g o ttat | ttat-i = y } . (2.17) 

In concrete examples, it is easy to deduce an explicit formula for K from (2.16) and (2.14) or 
directly from (2.17). By the permuation symmetry, 

Pi (9 ° Tj) = (Kg) o m for all i^j. (2.18) 

Combining (2.13), (2.14) and (2.16), we obtain 

Kg(y)= 5(7T A r_i((/> A r(x,y)))d^ A r_i(x) (2.19) 

J -XV -i 

which provides an explicit form for the operator K. For example, in the case of the Kac model we 
obtain 

Kg(v)) = / g(V 1 - v^w N -i)dfjt N -i(w) 

= J g(Vl-v 2 w)diy N - 1 (w) (2.20) 

ICJV-3I /•! , 



from (2.6) and (2.19). 

Theorem 2.1 Given any Kac system, let P and K be defined by (2.15) and (2.16). Define [In, 
kjst and (3n by 

»n = sup{ (/, Pf) HN | \\f\\ HN = 1 and (1, f) n „ = } (2.21) 

k n = sup{ (g,Kg) KN \ \\g\\ic N = 1 and (l,g)ic N = } (2.22) 

= F^T |inf{ ^,Kg) KN | \\g\k N = 1}| • (2.23) 
Suppose, moreover, that the operator P has pure point spectrum. Then, either fi^ = or 

MiV = max|l(l + (iV-l) Kj v) , l(l + (jV-l)/3jv) } • (2.24) 

In case > (3n, then the multiplicity of kn as an eigenvalue of K coincides with the multiplicity 
of as an eigenvalue of P, and the map 

( 1 \ 1/2 N 

*" U + (jy - 1)w) ) (*■*) 

zs an isometry from the k n -eigenspace of K in K-n to the n N -eigenspace of P inTCm. 
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Proof: Suppose that / is an eigenfunction of P with eigenvalue v. Since P commutes with 
permutations, we may assume that either / is invariant under permutations, or that there is some 
transposition, which we may as well take to be cti^, such that / o <7 li2 = — /. We will treat these 
two cases separately. 

First suppose that / is symmetric. Then for some h £ K-n independent of k, Pkf = h o i[ k , and 

so 

N N 

uf = Pf= N^ P Kf=N^ h07Tk - (2 ' 26) 

k=i k=i 

where v is the eigenvalue, and h is some function on IR. Applying Pi to both sides of (2.26) yields 

1 N 

» p if = xJ2 p ^ ho7r ^ ' 

fc=i 

which can be easily simplified to 

uh O TTi = — (h O 7Tl + ( N — l)Kh O 7Ti) . 

If h is identically zero, i.e., if Pjf = for all j = 1, . . . , N, the eigenvalue v must necessarily be 
zero by (2.26). Otherwise, h is an eigenfunction of K with eigenvalue v so that 

v=^(\ + (N-\)v) . (2.27) 

Note that v may vanish without h being the zero function. In this case h must be an eigenfunction 
of K with eigenvalue —\/{N — 1). In any case, v = or otherwise equation (2.27) must hold. 
We next consider the case in which 

/ocr 1)2 = -/ . 

Note that 

Pk(f O <7l, 2 ) = P k f = 

whenever /c is different from both 1 and 2. It follows that 

N 



A?" TV 
fe=i 

Again, applying Pi to both sides of the equation and keeping in mind that when P\f = hoir\ 
then P<if = —h o t[ 2 we get 

fh O TTl = — (h O 7Ti — K/l O 7Ti) . 

In case h vanishes identically, the eigenvalue v must vanish also, otherwise h is an eigenfunction of 
K with eigenvalue v such that 

1 -v , 
u = — . (2.28) 
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Since by the definition (2.23), — (TV— 1)(3n is the most negative eigenvalue of K, the determination 
of fi n now follows from (2.27) and (2.28). 

For the last part, observe from (2.27) and (2.28) that when kjv > 0n > 0, 

VN = ^{1 + {N-1)k n ) , (2.29) 

and, by the argument leading to (2.27), any eigenfunction of P with eigenvalue [In is in the range 
of the map in (2.25), and a simple computation verifies the isometry property. ■ 

Theorem 2.2 Given any Kac system, let P and be defined by (2.15) and (2.21). Define Xn by 
Aiv = sup{</, Qf) HN | \\f\\ HN = 1 , (/, 1) Hn = } . (2.30) 

Then 

Ajv < (Ajv-i + (1 - \n-i)hn) ■ (2.31) 

Moreover, there is equality in (2.31) if and only if the suprema in (2.30) and (2.21) are attained 
at a common function fx ■ 

Proof: We start from (2.12), taking any function / G Hn satisfying the conditions imposed in 
(2.30). 



1 N f 

(f,Qf)n N = T^E / (fj,y^Qfj,y)n N -i^N(y) 
.7=1 JYn 



3- 

N 



^E / {[f hy -P ] f(y)]+P 3 f(y),Q{[f 3 ,y-P ] f(y)]+P ] f(y)))n N ^N(y) 
i=i N 

i N r 

= ^E / (\fi,v-Pjtty)]>Q\fi,v-Pjf(y)])H N - 1 dMv) 
N j=1 Jy n 

i N r 

+ ^E/ \Pjf(y)\ 2 dMy) , 
N j=1 Jy n 

Since each Pjf(y) is constant on -Xjv-i and so on TCn-i, QPjf(y) = Pjf{y), an d 

([hy-Pjf(y)],Pjf(y))n N ^=o. 

But 

i N r 

^E/ \Pjf(y)\ 2 dMy) = (f,Pf)n N , 
N j=1 Jy n 
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and hence 



1 N r 

(f,Qf)n N = T7 E / <[/i.v " " P 3 f{y)])n N ^N{y) / ^ 

^ Vy w (2.32) 

+ (f,Pf)n N , 

Now since - Pjf(y)], = for each y and j, 

<[/™ - Pjf(v)],Q\fj,v - Pjf(y)])n N ^ < \N-i\\fj, y - Pjf 

= Aiv-i (11/i.vll^ - 1^/(^)1^-0 • 

Averaging over j and integrating over y, 

T7 £ / " p ,ttv)]>Q\fj,v ~ P 3 f{y)])n N ^N{y) < \n-i (\\f\\ 2 HN - (f,Pf)n N ) 

^ j=1 Jy n 

From this and (2.32), (2.31) follows, since / itself is an admissible trial function for /j^. The final 
statement is an evident consequence of the proof of (2.31). ■ 



Corrolary 2.3: With kn and (3n defined as in (2.22) and (2.23), define 

A N = N(l - X N ) . (2.33) 

Then 

A N > (1 - max{ K7V , N })A N -i (2.34) 
for all N > 3, and hence for all N > 2, 

N 

A N >Y[(1- max{K J ,p J })A 2 . (2.35) 

i=3 

Proof: This follows directly from (2.24), (2.31) and (2.33). ■ 

We see that a sufficient condition for liminfAr^oo Ajv > is A 2 > and 

oo 

Y[ (1 - max{KAT, p N }) > . 

N=3 
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Assuming that max{Kjv, Pn} < 1 for all iV > 3, this last condition is of course satisfied whenever 

oo 

maxjKAT, (3n} < oo • 



N=3 



3: Analysis of the Kac Model 

The Kac model, with (Xjy, Sn, ^n) being S' 7V_1 equipped with its rotation invariant probability 
measure and 

Qf(v 1 ,v 2 ,...,v N )=(*) Yl r P(0)f(v 1 ,v 2 ,...,v*(9),...,v*(e),...,v n )d9 , 

\ J {<i J-7T 

was the basic motivating example for the definition of a Kac system made in the previous section, 
where the rest of the elements of the system, namely the action of Ily, the spaces (Yn,Tn,un), 
and the maps iTj and <pj have all been specified. 

All that remains to be done before we apply the results of section 2 is to compute the spectrum 
of K. There are a number of ways that this can be done. The method presented here is the one 
that most readily adapts to the case of three dimensional momentum conserving collisions, which 
we treat in the next section. In a later section we shall use a more group theoretic approach when 
we discuss the generalization of the Kac walk to SO(N). 

Theorem 3.1 There is a complete orthonormal set {g n }, n> 0, of eigenf unctions of K where g n 
is a polynomial of degree n and the corresponding eigenvalue a n is zero if n is odd, and if n = 2k, 
a n is given by 

:N-3 



ON -.i f-TT 

«2fc = (-l)"^^ J o (l-sin 2 9) k S m N - 3 9de 



In particular, 



1 

Q 2 = 



«4 



«6 



«8 



TV- 1 
3 



N 2 -l 

15 



(3.1) 



(7V-l)(iV + l)(7V + 3) 
105 



{N -1)(N + 1)(N + 3)(N + 5) 
and |«2fc+2| < \c*2k\ f or all k. Hence for the Kac model, 

3 

max{K N ,/3 N } = k n = N 2 _i • 
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Proof: We have already deduced an explicit form (2.20) for K in the previous section. We note 
that by an obvious change of variable, we may rewrite it as 

I ^grJV— 3 I l-TT 

Kg{v) = j-^j J Q g(Vl-v 2 co S 6)sm N - 3 ed6 . 

The right hand side is clearly an even function of v. Since the operator K preserves parity, it 
evidently annihilates all odd functions. Hence we may assume that g is even. 

Further, since — t> 2 ) = (1 — v 2 ) k is a polynomial of degree 2k in v, we see that the space of 
polynomials of degree 2n or less is invariant under K for all n. This implies that the eigenvectors 
are even polynomials, and that there is exactly one such eigenvector for each degree 2k. 

Now let gik be the eigenvector that is a polynomial of degree 2k, and let a 2 k be the corresponding 
eigenvalue. We may normalize g 2 k so that the leading coefficient is 1, and we then have 

92k = v 2k + h(v) 

where h(v) is an even polynomial in v of degree no more than 2k — 2. Thus 

a 2 kV 2k + a 2k h(v) = a 2 k92k = Kg 2k = Kv 2k + Kh(v) 

This implies that 

Kv 2k = a 2 kV 2k + lower order . 
The result now follows directly from the formula for K, the recurrence relation 

f sm n {e)de = — - f sm n - 2 (9)d6 , (3.2) 
Jo n Jo 

and the fact that Kl = 1. Observe that the leading coefficient of v in (1 — v 2 ) k is (— l) k . ■ 



It is evident from (3.1) that, using the notation of Theorem 2.1, = 3/(N — 1) and (5 



N 



1/{N — l) 2 . Hence, for N > 3, km > (3n, and Theorems 1.1 and 1.2 are now proved. 

In order to prove Theorem 1.3, it is necessary to determine A 2 . But in (1.25) we have already 
determined A2, and since A 2 = 2(1 — A2), it follows that 



A 2 = 2 inf jy (1 - cos(fc0))p(0)d0 J . 



By the Riemann-Lebesgue lemma, A 2 < 1, and so in any case A 2 > 0. In the case Kac considered, 
S is just the projection onto the constants and A 2 = so that A 2 = 2. 
It remains to solve the recurrence relation (1.21). Notice that 

(jV-2)(iV + 2) 

1 - KjV= (iV_i)(Ar + 1 ) • < 3 - 3 ) 
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The product of these terms collapses and 



N 



n (j-2)(i + 2) = liV + 2 



Hence 

and it then follows from (2.35) of Corollary 2.3 that 

liV + 2 A 1 N + 2 , , 1 - A 2 iV + 2 , N 

Aat > — A 2 = — 2(1-A 2 ) = — — . 3.6) 

Now, we inquire into the sharpness of this result. By Theorems 2.1 and 3.1, 

P/n = ^n/n (3.7) 

if and only if has the form /at = Ylf=i 9n °^ and Kg^ = (3/ (iV 2 — 1))<7jv- That is, (3.7) holds 
exactly when, up to a multiple of 

N 

i=i 

By the last part of Theorem 2.2, the bound obtained in Theorem 1.3 can only be sharp if Q/n = 
^n/n for each N. Hence it is natural to compute Q/n- The result is contained in the next lemma. 



Lemma 3.2 For f N (v) = Z, =1 ~ <1,^», 



where 



7 = ^ - J cos(40)p(0)d0^ . (3.9) 



Proof: This is a straightforward calculation. ■ 



Clearly, for the original Kac model, with p uniform, 7 = 1/4, and so (3.8) implies that A at = 
N(l — Atv) is no larger than (N + 2)/{2{N — 1)). Since for the original Kac model A 2 = 0, this 
upper bound on coincides with the lower bound in (3.6) and hence (3.6) is sharp in this case. 

In fact, the upper bound on A^v provided by Lemma 2.3 coincides with the lower bound in (3.6) 
whenever / 2 (wi,f 2 ) = vf + v\ — (3/4) is such that 

Qh = A2/2 • (3.10) 
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Writing v\ = cos(#) and = sin(0), we have 



/(cos(0),sin(0)) = 



cos(4fl) 
4 



Hence (3.10) certainly holds whenever (1.28) holds. Finally, the fact that under the condition 
(1.28), /at is, up to a multiple, the only eigenfuncton of Q with eigenvalue Xn follows directly from 
Theorems 3.1, which says kn has multiplicity one, and Theorem 2.1. This completes the proof of 
Theorem 1.3. 

We shall show in section 7 of this paper that actually in a wide range of circumstances 



for all N sufficiently large, even if this is false for, say, N = 2. Thus in a great many cases Lemma 
3.2 provides the precise value of Xn, and hence Ajv, for large N. However, before returning to 
analyze the Kac model in this detail, we proceed to give several more examples of Kac systems. 

Having explained how our exact determination of the gap for Kac's original model works it is 
appropriate to compare this approach with Janvresse's [9] application of Yau's martingale method 
[13], [14] to the same problem. There are similarities between our analysis and Yau's method, in 
that Yau's martingale method uses induction on N, correlation estimates, and the same conditional 
expectation operators Pj. There are, however, significant differences, as indicated by the difference 
between Janvresse's estimate and our exact calculation. 

First, in Yau's method the spectrum of the Pj operators is estimated not in 7~Cn,e, but in the 
Hilbert space whose inner product is (h, (I — Q)h), the so-called Dirichlet form space associated to 
Q. This means that the details of the dynamics enter (through Q) at each stage of the induction, 
while in our approach purely geometric estimates, as described in Theorem 1.1, relate An to Ajv-i- 

Second, Yau's method was designed to handle problems without the permutation symmetry 
that is present in the class of models considered here. The method just described makes full use of 
this symmetry. As an example, using this symmetry, we need only to produce spectral estimates 
on P, the average of the Pj. That the inductive argument presented here makes full use of this 
permutation symmetry is one source of its incisiveness in this class of problems. 

4: Analysis of the Boltzmann Collision Model 

Consider now a pair of identical particles with velocities Vi and Vj in IB 3 . Now we will require 
that the collisions conserve momentum as well as energy. These are four constraints on six variables, 
and hence the set of all kinematically possible collisions is two dimensional. It may be identified 
with S 2 as follows: For any unit vector co in S 2 , define 



QIn — Xn/n 



V*(u) =Vi + (u ■ (Vj - Vi))u 
V*(uj) = Vj - (UJ ■ (Vj -Vi))u 

Now specify ./V velocities v = (vi, V2, ■ ■ ■ , vn) before the collision with 



(4.1) 



(4.2) 



N 



N 




and 




(4.3) 
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The random collision mechanism is now that we pick a pair i < j, uniformly at random, and 
then pick an uj in S 2 at random, and the post-collisonal velocities then become 



(Oi,.. .,v*(u), . . .,Vj(u), ...,v N ) ■ 
We then define the one step transition operator Q by 



/ N - lV 1 N f 

Qf(v) = i 2 J ^lf(v 1 ,v 2 ,...,v*(u),...,v*(u),...,v n )b(u;-(v i -v j )/\v j -v j \)duj, 

(4.4) 

where b is a non-negative function on [— 1 , 1] so that 



2tt f 6(cos#)sin(#)d# = 1 . 
Jo 



The function b puts a weight on the choice of uj so as to determine the relative likelihood of 
various scattering angles. This definition differs from corresponding definition for the Kac model 
chiefly through the more complicated formulae (4.1) and (4.2) paramaterizing three dimensional 
momentum conserving collisions. We begin the analysis of this Boltzmann collision model by 
specifying the structure needed to display it as a Kac system. 
By choice of scales and coordinates, we may assume that 

TV N 

J>/ = 1 and 2>i = (4.5) 

3 = 1 3=1 

both hold initially, and hence for all time. Thus our state space is the set of all vectors 

v= ( Vl ,v 2 ,...,v N ) G m 3N 

satisfying the constraints in (4.5). We equip with its Borel field and the metric and uniform 
probability measure inherited from its natural embedding in IR 3N . The symmetric group II jv acts 
on Xn as follows: for a G Iljv, 

<r(v 1 ,V 2 ,...,V N ) = (U <7 (1),U (T (2), • • • ,V a ( N )) . 

This action is clearly measure preserving. We note that Xn is geometrically equivalent to the 
unit sphere 5 3Ar_4 in M 3N ~ 3 , but apart from identifying normalization factors in our probability 
measures, this identification is not conducive to efficient computation because any embedding in 
M 3N ~ 3 obscures the action of the symmetric group. 

To identify the single particle state space Yjv, note that 

TV - 1 

sup{|i;at| I (vi,V2, ■ ■ ■ ,vn) G X n } = — . (4.6) 
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To see this, fix Vn and observe that due the momentum constraint in (4.5), J2f=i v j = ~~ V N- To 
maximize \vn\, we must minimize the energy in the first N — 1 particles. However, by convexity it 
is clear that 



inf < 



JV-l 



^ N 2 J2 V > 



i=i 



N-l 



-v N 



> 



is attained at 

(v 1 ,v 2 ,...,v N - 1 ) = - (v N ,v N ,...,v N ) , 

which leads directly to (4.6). 

In short, the momentum constraint prevents all of the energy from belonging to a single particle, 
and so each vj lies in the ball of radius y/(N — l)/N in IR 3 . (While this is true for N = 2, this 
case is somewhat special. For N = 2, v 2 = —v\ and so |i>2| = 1/V%, rather than \v 2 \ < l/y/2.) 

We could take Y/v to be the ball of radius a/ (N — 1) /N in IR 3 , for N > 3, which would then 
depend on N. However, certain calculations will work out more simply if we rescale and take Ijv 
to be the unit ball in IR 3 , independent of N. Therefore, we define, for N > 3, 

Y N = {v £ IR 3 | \v\ < 1 } 

and let TJv be the corresponding Borel field. We take Y 2 to be the unit sphere in IR 3 . We are then 
led to define ttj : Xn — ► Vjv by 

N ^ 1/2 



1Tj(v 1 ,V2,...,V N )= ^jj—jj V j . (4.7) 

The measure v N is now determined through (2.3), but before deducing an explicit formula for it, 
we introduce the maps <f>j : X^-i x — > Xjv, through which this formula is readily determined. 

Consider any fixed N > 3, so that X^-i is non empty. Fix a point tZ; = (101,102, • • • ,wn-i) € 
Xjv-i, and a point v £ Yjy. In order that we have 

7T N ((p N (w,v)) = V , 



the Nth component of 4>n(vj,v) must be yf (N — 1)/Nv. Now observe that for any a G iR, 



satisfies J2f=i v j = 0> an d 

N 

El |2 2 1 I 1 2 

N =a +|u| , 

j=i 

since [CjL'i 1 I^jl 2 = 1 an d S^Tl 1 u j = 0- Therefore, define 

a 2 ( v ) = 1 - \ v \ 2 (4.8) 
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and 



(p N {(w 1 ,w 2 , w N -!),v) = ya(v) Wl - ^ N l_ • • • , a^WN-i - -^== v, y ^-j^vj , 

(4.9) 

and we have that 4>n '■ -^at-i x Yn X^. For j = 1, . . . , N — 1, let ct^at be the pair permutation 
exchanging j and AT, and define <j)j = ct^at ° (j>N- We now show that with these definitions (2.9) 
holds, and in the process, obtain an explicit formula for v^. 

Lemma 4.1 For N > 3, the measure vn induced on through (2.3) for the Boltzmann collision 
model is 

d^W = p^(l-H 2 ) (3JV - 8)/2 d-. (4.10) 

In the case N = 2, u 2 is the uniform probability measure on S 2 = Y 2 . Moreover, for these measures 
vn, and with 4>j defined as above, (2.9) holds for the Boltzmann collision model for all N > 3. 

Proof: The measure is defined through the natural embedding of in IR 3N , and hence it is 
advantageous to consider the tangent spaces to Xn as subspaces of IR 3N . Making this identification, 
a vector £ = £ 2) • • • > £at) is tangent to Xn at v = {v\, v 2 , ■ ■ ■ , vn) G Xn provided 

N N 
3 = 1 3=1 

Likewise, a vector if = (r/i, i] 2 , ■ ■ ■ , Vn-i) is tangent to ^at_i at w = W2, • • • , u^at-i) G Xn-i 
provided 

JV-1 AT-1 

5>,-^ = and E% = 0. (4.12) 

And finally, it is clear that the tangent space at any point v of Yn is IB? . 
Now let 

(<f> N )* : T,(Xjv_i) x T^Fat) T,(Xjv) 

be the tangent bundle map induced by 0at. One easily computes the derivatives and finds that for 
a tangent vector (ff, 0) at (uJ, v) , 

(M.fo0) = (a(«)7f,0) . (4.13) 
Likewise, for a tangent vector (0, u) at (w,v), 



1 1 \ fZ \ I v " u 1 u " u 1 N —1 \ 

<wv)*(U,u) = — — wi u, . . . , — — — wn-i m, \ u . (4.14) 

K ' K ' \a(v) ^N 2 -N a(v) N ^N 2 -N V N J V ; 

Now let (x be any vector of the type in (4.13), and let (y be any vector of the type in (4.14). 
Obviously 

(Cx,(y) = (4.15) 
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where the inner product is the standard inner product in M 3N . Moreover, 

(Cx,Cx) = a 2 (v){fj,rf) 



(4.16) 



where the inner product on the right is the standard one in m 3N ~ 3 . The determinant of the 
quadratic form qx given by 

ff i ^ a 2 (v)(rf, fj) 



is 



det(q x ) 

since X^-i is 3iV — 7 dimensional. Finally, 



a 



2{ZN-7) 



(v) 



a A (v) 

and the determinant of the quadratic form qy given by 

(v ■ u) 2 



(4.17) 



(4.18) 



u 



a 2 (v) 



+ u 



IS 



det{q Y ) = 1 + 



a 2 (v) a 2 (v) 



(4.19) 



Now let {In and /Iat-i denote the unnormalized measures on Xn and Xn-i given by the Rie- 
mannian structures induced by their natural Euclidean embeddings. If(x\,..., X3N-7) is any set of 
coordinates for X^-i, and if (2/1,3/2,2/3) are the obvious Euclidean coordinates for Fjv, then these 
induce, through 0jv, a system of coordinates on Xm- (Since X^-i is a sphere, up to a set of mea- 
sure zero, one chart of coordinates suffices.) The volume element d/2jv(x,y) in these coordinates 
can now be expressed the volume element d/ijv-i(x) using (4.15) (4.17) and (4.19): 

1 



d(i N (x,y) = a (v)dfi N _ 1 (x)——-dy . 



a(v) 



Since we know that 



Ixj, 



dpL 



N 



->3JV-4i 



we easily deduce from this that for all continuous functions / on X N , 

(1 _ | v |2 )( 3iV-8)/2 du 



f f(v)dfi N = \ S Q3N _ 4 \ [ [ fo (f) N dfi N -i 

JX N P I JY N 



(4.20) 



Finally, suppose that / has the form / = g o ir^ for some continuous function g on Yjy, N > 3. 
Then evidently / o <p N (w,v) = g(v) everywhere on X^-i x Y/v and hence by the definition (2.3) 
and (4.20), 

\S 3N ~ 7 \ 



/ gdu N = / fdfi N - , 
Jy n Jx n P 



g(v)(l - \v\ 2 )^ N - 8 ^ 2 dv 



(4.21) 



Y N 
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Hence we see that (4.10) holds, and hence that (2.9) holds for the Boltzmann collision model. ■ 



Lemma 4.2 The Boltzmann collision model, consisting of (X n ,Sn, Hn), (Yn,T n ,v n ), ttj, 4>j> 
j = 1, . . . , N, and Q as specified in this section constitute a Kac system as defined in section 2. 

Proof: The properties not already established in Lemma 4.1 are now easily checked using (4.9). 



Now in order to apply the results of section 2 to this Kac system, we need to determine the 
spectral properties of the operator K. The explicit form of K for the Boltzmann collision model is 
easily obtained from (2.19): For all functions g on Y^, the unit ball in in IR 3 , and all TV > 3, 



K9{v) = jL x 9 (\^r (v^h 1 ^ - Tpbr)) 




(l-|y| 2 ) (!W - 11)/! Wi(v) (4-22) 



=KL- ( - *M * - ^ 

(The restriction to N > 3 is because (4.21) only gives us the right form for v N is this range. Indeed, 
37V — 11 is negative for N = 3. The correct analogs of (4.21) and (4.22) are easily worked out by 
the same sort of analysis. We do not do this here, as we do not need these formulae.) 

Several properties of K are evident from (4.22). First, K commutes with rotations in IR 3 . That 
is, if R-.m 3 -> m 3 is a rotation, then clearly 

K(g o R) = (Kg) o R . 

Hence we may restrict our search for eigenfunctions g of K to functions of the form 

g(v)=h(\v\)\vfy e , m (v/\v\) 

for some function h on IR + , and some spherical harmonic 3^,m- 

Second, for each n > 0, K preserves the space of polynomials of degree n. To see this notice 
that any monomial in sj\ — \v^w that is of odd degree is annihilated when integrated against 
(1 — \w\ 2 )^ 3N ~ 11 ^ 2 dw, and any even monomial in y/l — \v\ 2 w is a polynomial in v. 

Combining these two observations, we see that K has a complete basis of eigenfunctions of the 
form 

9n,e,m(v) = h n , e (\v\ 2 )\v\ e y e , m (v/\v\) (4.23) 
where h n ^ is a polynomial of degree n. 
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A third observation leads to an explicit identification of these polynomials and a formula for the 
eigenf unctions: Suppose that Kg(v) = Xg(v). Let e be any unit vector in M 3 . Then since g is a 
polynomial and hence continuous, 



lim^(te) = Jim jf^ g [^^VT^ W - ^e) (1 - M^-iD/'d*, 



1 

-e 



TV — 1 

since .FTl = 1. Combining this with Kg(v) = Xg(v), we have 

Now consider any eigenfunction g n ,e,m of the form given in (4.23), and let X n ,e be the corre- 
sponding eigenvalue, which will not depend on m. Then taking any e so that 3^,m(e) / 0, we have 
from (4.24) that 

^^«(-^)'- 

Finally, a fourth elementary observation identifies the polynomials h n ^. For all distinct positive 
integers n and p, the eigenfunctions g n ,i,m and g Pj i jm are orthogonal in /Cat. Hence for each £, and 
for n ^ p, 

/ V<(M 2 )'w(M 2 )(i - M 2 ) (3Ar - 8)/2 M 2£ d, = o . 
7i«i<i 

Taking r = |t>| 2 as a new variable, we have 

f h n , e (r)h p ,e(r)(l - r )^)/\^dr = . 
J o 

This is the orthogonality relation for a family of Jacobi polynomials in one standard form, and this 
identifies the polynomials h n ^. A more common standard form, and one that is used in the sources 
to which we shall refer, is obtained by the change of variable t = 2r — 1, so that the t ranges over 
the interval [—1, 1]. Then for a, (3 > —1, J%*'^\t) is the orthogonal nth degree polynomial for the 
weight (1 — i) Q (l + t) 13 . Then with the variables t and \v\ 2 related as above; i.e., 

t = 2\v\ 2 - 1 , (4.26) 

h n , e (\v\ 2 ) = 4 a >V(t) (4.27) 

for 

3N — 8 1 
a = and = i + - . (4.28) 
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The particular normalization of the Jacobi polynomials is irrelevant here, as we shall be concerned 
with ratios of the form Jn Q '^(i)/Jn Q '^(l). Indeed, notice that from (4.26) when \v\ 2 = 1, t = 1, 
and when |t>| 2 = 1/(N 2 - N), t = -1 + 2/(iV 2 - N). Hence from (4.27) and (4.25), we see that 



A n ,£ — 



jf\-l + 2/{N 2 -N)) 



■J; 



(1) 



N - 1 



(4.29) 



We summarize this in the following lemma: 
Lemma 4.3 Define the functions 

g n ,e,m(v) = h n ,e{\v\ 2 )\vfy e , m {v/\v\) 

n > 0, £ > and — £ < m < £, where the ye : m « re an orthonormal family of spherical harmonics, 
and the h n ^ are polynomials expressible in terms of the Jacobi polynomials through (4-27). Then 

{g n ,e,m I n>0,£>0,-£<m<£} 

is a complete orthonormal basis of eigenfuntions of K . Moreover, if is the corresponding 
eigenvalue, then (4-29) holds. 

The problem of determining the spectral gap for K is thus reduced to the problem of determining 
the largest number of the form (4.29). The following integral representation of ratios of Jacobi 
polynomials, due to Koornwinder [11] (see also [1], pp. 31 ff.), is useful in this regard. 

For all —1 < x < 1, all n and all a > (3, 



J, 



(x) 



7T fl 

o Jo L 



1 + x — (1 — x)r^ 



+ i\J 1 — x 2 r cos(#) 



dm a)/3 (r,0) 



(4.30) 



where 



m a ^(r, 9) = c a ,fs(l - r 2 )«-£- V^ +1 (sin0) 2/3 ^ 



and c a ^ is a normalizing constant that makes dm a ^ a probability measure. 
Notice from (4.28) that a > (3 exactly when 2£ < 3N — 9. Hence we define 



£o = 



3N -9 



(4.31) 



For all £ < £ , we may use (4.30) to compute A n ^. 
First, however, observe that 



1 + x — (1 — x)r 



+ iy 1 — x 2 r cos(#) 



(1 + X) 2 {1 -X) 2 2 1 -X 2 2 

^— -L + K -—^r 2 + — — r 2 cos(2#) < 1 



(4.32) 
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with equality exactly when r = 1, and 9 = or ir. 
Lemma 4.4 For all I < £q(N), and all m, 



\^n,t\ < fJ"n,t 



where, with x = -1 + 2/(N - l) 2 , 



/■* i + x -(i- x y . r 

Hn,t= / / o h«vl - a; 2 rcos(6») 

Jo Jo 1 

Moreover, for each t, n ^ [i n ^ is monotone decreasing: 



n/2 



dm a ^(r,6) 



N - 1 



Hk,e < Hj,t for all k > j . 



Proof: The montonicity follows directly from (4.32), and the rest is a summary of the discussion 
above. 

We now proceed to calculate the eigenvalues for £ + 1/2 < (3iV - 8)/2 using (4.30) and (4.29). 
The case n = is trivial: 

1 



•W — M0,£ — 



N- 1 

for all £ < 4>- The montonicity in Lemma 4.4 now guarantees that for all n and all 3 < £ < £q, 



(4.33) 



l<VI < 



1 



TV — 1 



(4.34) 



Next, it is straightforward to calculate Ai^ and A2/ using (3.2) and the beta integral 



L 



\ 1 -tr-H b - 1 dt= T{a)r{b) 



The results for n = 1 and n = 2 



Ai /> = 



e-(l-e) 



21 + 3 
3N-6 



1 



TV- 1 



and 



where 



A2,£ = 



^_M±22 E(1 _, ) + (1 _ E) ^ + 5 >( 2< + 3 > 



3iV-6 



(3n- 6) (37V -4) J V N-l 



e = 



(N-l) 2 ' 
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(4.35) 

(4.36) 
(4.37) 



The eigenvalues 

21N 



+ 



_(N - 1) 3(iV-l) 2 



N 



are all negative and hence irrelevant for calculating the gap of K. Note that A2,o is asymptotically 
5/3iV 2 and otherwise A2/ is 0(l/N 3 ). In fact for all £ in the specified range with £ > 3 

. 1 1 

\An,e\ < Hn,e < /^0,3 < -^3 • 

Finally, a very simple computation provides a constant C independent of N so that 

C C C 

M3,o < ^3 , M2,i < ^3 , and m , 2 < — . (4.38) 

(The cases with n odd are most easily done through estimates on cases with even n, For example, 
since < ^Jf by Holder's inequality, it suffices to show that = 0(l/iV 4 ).) Therefore, again 
using the montonicity form Lemma 4.4, the only values of (n, £) with I < £ such that \X n ,e\ is of 
order (1/N 2 ) or larger are those for which n + £ < 2. By the computations above, we then have 

sup{ A„, £ I n + £ > 0, £ < £ } = X 2 ,o (4.39) 

and 

inf{ \ n>e \£<£ } = Ai, • (4.40) 

Regarding the restriction £ < ^0 m (4.39) and (4.40), it is reassuring to note that for the largest 
value of £ in this range, the corresponding eigenvalues are no larger than (l/(iV — l))( 3Ar_11 )/ 2 . 
This suggests that a fairly crude bound on the part of the spectrum corresponding to t > £q will 
suffice to eliminate the restriction on £ in (4.39) and (4.40). We shall show that this is the case. 

For this purpose we need the integral kernel corresponding to the operator K. From (4.22) we 
have that for all g € /Czv, 



(9, Kg)tc N 

\g3N-10 



\ S 3N- 



il I 9(v)9(u(y,v))(l - M 2 )r- U)/2 (1 " Iv^r-^ydv , 



where 



( VN* - 2N r—^ 1 

u (y^ v ) = — r T — 1 — v 1 - \ v \ y - — — ~ v 



n -1 v 1 1 a n -1 

Making the change of variables (y,v) — ► (u(y,v),v) we find 



^ 1 ( N ~ 1 2 1 2 2 
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which leads to 



| S 3JV-4| / N 2_2N 



3/2 



|53JV-io| y N 2 -2N + 1 

J\y\<l J\v\<l \ iV _ 1 



(g,Kg) KN = 



2 N-l 2 2 

u v 

2 7V-2 N -2 



u • V 



(3iV-ll)/2 



(4.41) 



We therefore define the kernel K (u, v) by 



K(u, v) 



\S 3N ~ 



10 1 



| 5 3iV- 



iV 2 - 2iV + 1 
N 2 -2N 



3/2 



1 - 



N- 1 2 iV- 1 o 2 

■W — -V — 



N -2 iV-2 7V-2 



(3iV-ll)/2 



Now let the g n ,e,m be the normalized eigenfunctions introduced in Lemma 4.3. Then 



(4.42) 



-gntj 



1 

m= — t 



m=-£ 



if(u, i> )dudu; 



(4.43) 



= y /i„/(|u| 2 )/i n/ (|w| 2 )P^((n • i;)/|ti||t;|).K'(t;,u;)dt;du; 

where, P^(cos(0)) is the Legendre polynomial of order £, and it is orthogonal to all other polynomials 
of degree strictly less than £. 

Now note that the positive part taken in (4.42) is superfluous unless the values of \u\ and \v\ are 
such that both 



and 



N — 1 

'i 

N -2 

N — 1 9 

V 

N -2 



N — 1 

N — 2 



w 2 < 



N -2 



v\ \w\ 



N — 1 

N-2' 1 



-w z > -- 



N-2' 



v w 



(4.44) 
(4.45) 



Therfore for all values |it| and \v\ that are in the complement of the set defined by (4.44) and (4.45), 
the kernel K(u,v) is a polynomial of degree (3N — 11) /2 in cos{9) = u ■ v/(\u\\v\), provided only 
that N is odd so that (37V - ll)/2 is an integer. Now if £ > £ , then then £ > (3N - ll)/2 Hence, 
for such fixed values of \v\ and \ w\, the integral over 6 vanishes, and we may as well redefine K(v, w) 
so that it vanishes on the complement of the set defined by (4.44) and (4.45). 
Hence, going back to (4.43), we have the whenever N is odd and £ > £0, 



1 ^ 

7Tc , ; y2 / 9ne, m (u)g n£:Tn (v)K A (u,v)dudv (4.46) 

21 + 1 r ^. e JM<iJ\v\<i 
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where 



/ \ (3JV-ll)/2 

/ j\f 1||2 N 1 l 1 2 2 \ 

K A (U,V) = ^ ^2 _ 2 jV ) \S 3N ~ 7 \ (1 _ \ v \2)(.3N-8)/4(i _ | n |2)(3JV-8)/4 1 a(u,v) 

and where A is the set of points (u, v) G iR 6 satisfying (4.44) and (4.45), and finally where 

1/2 



/ I 03^-71 \ L / z 



The last definition is such that 



/ \g n t,m{u)\ 2 &u = \\g n e, m \\ 2 K: N = 1 , 

J\u\<l 



>\u\<l 

and so by (4.46) and the Schwarz inequality, whenever N is odd and £ > £q, 

\l A < f f \K A (u,v)\ 2 dudv . (4.47) 

J\u\<l J\v\<l 

This leads directly to the following lemma: 

Lemma 4.5 There is a finite integer N such that for all odd integers N > N , and all i > £q, 

fi\*" 2 
K<i<(^J ■ 

Proof: On account of (4.47) and the definition of K A , our task is to estimate 



L 



/ x 37V-11 

M _ iV^l | |2 _ N=l\ \2 2_ \ 

y 1 AT— 2 1 "I AT-2l y l N-2 U V ) 

■dudv . 



I A (1 _ | u |2)(3Af-8)/2( 1 _ |^|2)(3iV-8)/2 

Define the quadratic forms 



on IR 2 . Notice that 



AT — 1 , l2 AT — 1 , l2 2 \ 

1 — — \U\ —— \V\ —— U ■ V < 1 — (7- ( it , \V ) 

iV-2 1 1 iV-2 1 1 iV-2 / ~ ^ vi hi i; 
and that 

(l-M 2 )(l-|u| 2 ) > (l-(|u| 2 + M 2 )) . 
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Hence the integrand above is no larger than 



<i-.-<M,M>r- u (448) 



(1 - (\ U \ 2 + | V |2))(3iV-8)/2 . 

Next note that the eigenvalues of q± are, in both cases, 1 and N/(N — 2). Hence 

{\u\ 2 + \v\ 2 ) < q±(H\v\) < ^(l^l 2 + M 2 ) • (4-49) 
Thus, by the first of these inequalities, the ratio in (4.48) is no greater than 

(i-,_(RM)) (3iV - 14)/2 • (4.50) 

The conditions (4.44) and (4.45) can be expressed as 

q + (\u\,\v\) >1 and q-(\u\, \v\) < 1 . (4.51) 

Combining the first inequality in (4.51) with the second inequality in (4.49) we have that on A, 
\u\ 2 + \v\ 2 > (N - 2)/N, and hence g-(|u|, \v\) > (N - 2)/N on A. Hence on A, the quantity in 
(4.50) is no greater than (2/iV)( 37V_14 )/ 2 . The result now easily follows. ■ 

Note that on account of this result, increasing iV"o if need be, the condition t < £ in (4.39) and 
(4.40) may be dropped, and the estimates remain valid, for all odd integers N with iV > Nq. Our 
next task concerning the determination of the spectral properties of K is to remove the condition 
that N be odd. 

Lemma 4.6 Let kn and (5n be defined for the Boltzmann collision model as in Theorem 2.1. Then 
for all N > 3, 

\(3n\ < \0n-i\ (4.52) 



and for all N such that kn-i < 1/2, 



KN < . (4.53) 

1 - K N -i 



Proof: First let g satisfy HffUcjv = 1 an d Kg = KNg- Then 



k n = (g, Kg) 

= / g ° ^ig ° vr 2 d^jv 



(4.54) 



/ / {go7r 1 o<p N )(goir 2 o(J) N )dfi N - l 

J Yjv J Xjv - i 



'N 
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Now, 
where 



g O 7Tl O <fi N (uJ, v) = h v O 7Tl (t?) 



My) = 5 — 77 — : — v 1 - My - — — ~ v 



N - 1 



N — 1 



Finally, let h v be given by 



h v o 7Ti = h v o m 



- K 

J Xn-i 



O 7TiO>Ar_i 



= /l„ O TTi - Pjv(y O TTl) 
= fl v O ITi — Kg O 7TAT . 

Going back to (4.54) and using the variational definition of kn-i we have, much as in the proof of 
Theorem 2.2, 



KN<^N-lj j \K 7Ti| 2 d//jV-i 

J Yat J Xn-i 

= K N -i + (1 - K N -l)K 2 N . 

Since kat_i < 1, this last inequality may be written as i^t^iv) > where 

P 2 (a:) =x 2 



(4.55) 



1 Kat_i 

-x + 



1 - KaT-1 1 - KAT-l 

The polynomial ^(x) has the roots x = 1 and x = kjv-i/(1 — «jv-i) < 1 and is negative between 
these two numbers. Since kn < 1 (4.53) follows. 

The proof of (4.52) is similar but simpler. Suppose g satisfy ||<7||jCiv = 1 an d Kg = (3n9 where 

N = m£{(h,Kh) | \\h\\ KN =1} . 
The analysis that lead to (4.55) now yields 

0N>PN-i + (l-PN-i)\\Kg\\l N 
which certainly implies (4.52). ■ 

We are finally ready to prove the analog of the original Kac conjecture for the Boltzmann collision 
model: 

Theorem 4.7 For the Boltzmann collision model 

lim inf A N > . 
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Proof: We choose N large enough so that for all odd integers N > N , 

where A2,o is specified in (4.36) and (4.37). We can do this since A2,o ~ 5/iV 2 , and Lemma 4.6 tells 
us, increasing N if need be, that 

2 

for all N > N . Now by Corollary 2.3, 

OO 

liminfA;v > ]J (l-2/j 2 )A No . 

j=N„ + l 

The infinite product is clearly strictly positive, and so it remains to verify that A^r > for all N, 
and in particular N = Nq. 

This may as well be done by a compactness argument since we are not being specific about Nq. 
For 1 < i < j < N, define 

Ri,jf(v)= / f(vi,v 2 , ■ ■ ■ ,v*(u), . . . ,v*(u),. . . ,v n )b(uj ■ (vi - Vj)/\vj - Vj\)duj , (4.56) 
Js 2 

so that by (4.4), 

/ x -1 n 

This operator is not compact. In the case where u is selected uniformly, one easily sees that for 
any unit vector e G IR 3 , and any odd integer k, fk{v) = (tti(v) • e) k is an eigenfunction of Q, with a 
non-zero eigenvalue independent of k. In the case of the Kac model this was explicitly observed by 
Diaconis and Saloff-Coste, and this may have been clear to Kac when he remarked on the difficulty 
of showing that Ajv > for the original Kac model. 

However, consider Q 2N . Observe from (4.56) and (4.57) that Q 2N is an average over monomials 
of degree 2N in the operators Rij. Each such monomial enters with the same positive weight, and 
each is a contraction on Hn, since clearly each R i; j is a contraction onTL^. 

Now one such monomial is 

^4 = 2R2, 3-^3,4 • • • Rn,i) (Rn,i ■ ■ ■ R4, 3-^3,2-^2,1) 
which is positive. It follows that there is a positive number a so that 

Q 2N = a A + (1 - a)B 
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where B is a self-adjoint contraction on Hn- (B is the average over the remaining monomials.) 
Now it is easy to see that A is compact. Since it entails averages over each of the variables, it has 
a continuous kernel, and hence is Hilbert-Schmidt. Now 

sM(f,Q 2N f)n N \\\f\\n N = 1, (1J)h n = 1 } 

= su P {(f,aA+(l-a)Bf) \\\f\\ HN = 1, (1, f) Hlf = } 

<asup{(/,A/> \\\f\\ nN =1,{1J) Hn =0} + (l-a) . 

Now since vl is compact, sup{(/, Af) Hjv | H/H^jv = ( l J)n N = } is attained. And clearly if / 
satisfies \\f\\n N = 1, ( l J)n N = 0, and (f,Af) Hff = 1, then ||.Rfc,fc+i/||wjv = 1 = ||/|| Wjv and this 
is impossible by our ergodicity assumptions. ■ 

5: Analysis of a Shuffling Model 

When momentum and energy are conserved for one dimensional velocities, the only possibility 
is an exchange of velocities. Thus the Kac walk in this case is simply a walk on the permutations 
of (v\,V2, ■ ■ ■ ,vn), which, at least when all of these velocities are distinct, we may identify with a 
random walk on the permutation group LTjv. The corresponding walk has been throoughly analyzed 
by Diaconis and Shahshahani [6] , but we briefly discuss it in this section to illustrate several features 
of our approach. (In fact, they estimate approach to uniformity in the total variation norm, for 
which they need, and derive, not only the spectral gap, but information on all of the eigenvalues 
and their multiplicities.) 

Let Xm = IIjv, and let 

Y N = {1,2,...,N} . (5.1) 

For a £ U N , define for j = 1, 2 . . . , N, 

n j (a)=a(j) . (5.2) 

Let /i be normalized counting measure on Xn, so that i/jv is normalized counting measure on Yn- 
To define the transition function, fix a number p with < p < 1, which will represent the 
probability of "success" in a coin toss. Consider a deck of iV distinct cards which are to be "shuffled" 
as follows: Pick a pair i < j uniformly at random, and then toss a coin to generate independent 
Bernoulli variables with success probability p. If the result of the coin toss is success, exchange cards 
at the ith and jth positions from the top of the deck, and otherwise do nothing. This procedure is 
then repeated. 

We can identify the state of the deck at each stage with the permutation a which puts it in that 
order starting from a canonical "unshuffled" order. In these terms, the current state a is updated 
by 

where Cjj is the pair permutation exchanging i and j, and fixing all else. If the result is not success, 
the current state a is not altered. The one step transition operator is clearly 

Qf{a) = ( N 2 ) 1 E tP/K*") + C 1 " P)f^ ■ ( 5 - 3 ) 

^ / i<j 
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To display this as a Kac system, define (f>N '■ -Xjv-i x Yn — ► by 

(f> N (a, k) = a k ,NV (5.4) 

where a(j) = a(j) for j < N — 1, and <r(iV) = AT. Note that tin o cj) N (a,k) = 0- fei iv(cr(AO) = 
(Tk,N{N) = k. We then define 4>j = (fi^ ° a j.N- It is clear that these maps are bijections, and since 
/i/v-i ® fjv is uniform counting measure on Xn_i x Y^. (2.9) is trivially true. Thus it is clear that 
this shuffling model is a Kac system. 
Moreover it is easy to see that 

N 

Kg( i ) = Y J K l , J g(j) (5-5) 

3 = 1 

where 

Hence K has the eigenvalues 1, with multiplicity one, and —\/{N — 1) with multiplicity N — 1. 
Hence for this model, with and (3n as in Theorem 2.1, 

-kjv = (AT - = (5.7) 

and thus by Corollary 2.3 



i=3 

Again, this product collapses, and one finds 



N ( 1 \ 



(5.8) 



1 WII 



n{l-77fTO) = {»^T)5- < 5 ' 9 > 

i=3 



(i-i)V \N-1J2 

Clearly Q 2 may be identified with the matrix 



l-p p 
p l-p 



(5.10) 



and hence Q2 has the eigenvalues 1 and 1 — 2p. Hence A2 = 1 — 2p, and A2 = 4p. Combining this 
with (5.8) and (5.9), we have 

N 

A,v > j^jlP • (5-11) 

To see that this result is sharp, we need to display an appropriate eigenf unction. We know 
from Theorem 2.2 that (5.11) can be sharp if and only if there is a function /jv satisfying both 
Q/n = ^n/n and P/jv = A* at/at- Theorem 2.1 then tell us that since (3n > k-n for this problem, we 
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get an eigenfunction of P with P/jv = unJn from eigenfunctions h of K with Kh = —1/(N — l)h 
through 

f N = h o 7Tj - h o iTj for 1 < i < j < N (5-12) 

for some i < j. A tedious but straightforward computation, using Ylf=i = 0, which is equiva- 
lent to Kh = -1/(N - l)h, shows that 

QfN=(l-J^j)fN. 

This implies that \n > 1 — 2p/(N — 1) and hence 

A,v < ^^2^ . (5.13) 

This leads to the following result: 

Theorem 5.1 The binary shuffling model is a Kac system, and 

An = W^T 2p ■ ( 5 - 14 ) 

Moreover, 

N(I-Q)f=(J^- i 2 P )f 

if and only if f has the form specified in (5.12) for some function h on {1,2, . . . ,N} such that 
YljLi = 0- I n particular, Aat is an eigenvalue of Q of multiplicity (N — l) 2 . 

Proof: The equality (5.14) follows from (5.11) and (5.13), and this identifies Aat. We have shown 
above that every function / of the form (5.12) with = satisfies Qf = Ajv/, and by 

Theorem 2.2, the converse holds as well since any such / must also satisfy Pf = /xat/, and this 
occurs only when / has the specified form. Finally, it is easily seen that the N — 1 functions 

/i O 7Ti — h O TT 2 , ho T\ 2 — ho TT 3 , /l O 7T 3 — /l O 7T 4 , ... ,/lO 7TjV_l — ho TT N 

are a basis for the span of the functions of the form specified in (5.12) whenever ||/i||/c / and 
^ v =1 h(j) = 0. Also if h and h are any two orthogonal eigenfunctions of K, h o m — h o ttj is 
orthogonal to h o tt^ — ho tt^ for all i < j and k < i. Since the —1/(N — 1) has multiplicity N — 1 
as an eigenvalue of K, the final statement is now shown. ■ 

Diaconis and Shahshahani actually devote most of their attention to the model in which the 
success probability p depends on N through p = 1 — 1/N. The present methods are easily adapted 
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to handle the case in which p depends on N. Let Q r denote the transition operator defined in (5.3). 
Then clearly for two different success probabilities p and p' , 

This may be used to take into account the effects of the N dependence in p on X N . 
6: The Kac Walk on SO(N) 

Let SO(N) denote the group of orthogonal N x N matrices with unit determinant. In this 
section we consider a generalization of the original Kac model in which the state space is SO(N) 
instead of S N_1 . This generalization was introduced by Diaconis and Saloff-Coste [5], and studied 
by Maslin as well, both in the case of "uniformly selected rotations", i.e., p{9) = l/2n. To explain 
the nature of the underlying process, which these authors call the "Kac walk on SO(N)" , we let 
Ri,j(0) denote the same rotation in M N that was used in (1.7), except now we identify it with the 
corresponding N x N matrix, and we will now consider our iV-dimensional velocity vectors v as 
column vectors of an othogonal matrix. Then multiplying Rij(8) and the "pre-collisional velocity 
vector" v produces the "postcollisional velocity vector", just as in the original Kac model. 

Given a continuous function / on SO(N), define 

Qf(G) = ]T f f(R iJ (9)G)p(9)de (6.1) 

where p{6) satisfies the same conditions imposed on p{6) in the original Kac model. 

The connection with the Kac walk on S*^ -1 becomes quite clear when one writes G in terms of 
its N columns, G = [vi, V2, ■ ■ ■ , vn], since then 

Ri,j{9)G = [Ri,3{9)^,Ri,j{9)^ 2 , Ri, 3 {e)v N ] . 

Each of the Vj is an element of S N_1 , and it is clear from (1.7) that under the Kac walk on SO(N), 
each column of G is a Markov process in its own right, and is in fact a copy of the original Kac walk 
on S N_1 . Therefore, if / depends on G only through the first column of G, which is an element of 
S N_1 , Qf coincides with what we would get by applying the Q operator for the Kac model to / 
considered as a function on S N ~ X . In this sense the Kac walk on SO{N) is a generalization, and 
indeed, an extension, of the Kac walk on S N ~ X . 

This relation between the Kac walks on SO(N) and S^ -1 provides an immediate upper bound 
on the spectral gap for the walk on SO(N): We see, by restricting the class of test functions to 
those that depend on G only through a single column, that the spectral gap for the Kac walk on 
SO(N) cannot be larger than the spectral gap for the Kac walk on S N ~ l . 

In fact, as found by Maslin in the case in which p(6) is uniform, the two gaps actually coincide. 
In this section, we prove this also when p is not assumed to be uniform. The Kac walk on SO(N) 
provides a good illustration of the methods of this paper in which the "single particle space" depends 
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on N. This example goes beyond our previous examples in other ways as well, as we shall see as 
soon as we begin with displaying it as a Kac system. 

It turns out that it is most convenient to do this through consideration of the Kac walk on 
0(N), the group of orthogonal N x N matrices. For a continuous function / on O(N), we define 
Qf exactly as above, except that now G now ranges over O(N). We equip O(N) with its Borel 
field and its normalized Haar measure /j,n- Then by our assumptions on p, Q extends to be a self 
adjoint Markovian contraction on Hn = L 2 (0(N), /j,n)- However, it is not ergodic; the nullspace 
of Q is two dimensional, and spanned by 1 and det. Of course, on the subspace L 2 (SO(N), fiff), it 
is ergodic. 

The reason for working in the non-ergodic L 2 (0(N), hn) setting is that permutation symmetry 
plays an important role in our analysis. The natural action of II jv on 0(N) is through interchange 
of rows. Note that this extends the action of Hn on S N ~ 1 , considered as the first column of G, that 
we used in our analysis of the original Kac model. Interchange of two rows of an element of SO(N) 
of course changes the sign of the determinant, and so does not preserve SO(N). An alternative 
is to conjugate elements of SO(N) by the permutations; that is, to swap both rows and columns. 
This however complicates the construction of a Kac system for the model, and in any case, it is 
of some interest to show that the methods used here can be applied when there is more than one 
ergodic component. This said, we proceed with the construction of the Kac system. 

The "TV-particle space" (Xn, Sn, A*jv) will of course be O(N) equipped with its Borel field and 
its normalized Haar measure is /j,n, as indicated above. Let 0(N) + denote the component of O(N) 
on which the determinant is positive, so that 0(N) + is just SO(N), and let Let 0(iV)_ denote the 
component of 0(N) on which the determinant is negative. 

For any permutation a in n^v, let P a denote the corresponding N x N permutation matrix. For 
G in O(N), define a(G) by 

a(G) = P a G . 

That is, cr acts on the matrix G by permuting its rows. Clearly this is a measure preserving action 
of n N on 0{N). 

We take the single particle space (Yn, TJv 5 vn) to be S N_1 equipped with its Borel field and 
normalized rotation invariant measure vn- For each j = 1,2,... ,N, let ej be the jth standard 
basis vector in R N , written as a row vector, so that for any N x N matrix A, ejA is the jth row 
of A. We then define 

7r j (G) = e j G. 

That is, TTj(G) is the jth row of G. It is clear that ttj : Xn — ► Yn and that 

7T J (a(G))=7T aU) (G) 

for each j and G. So far, we have verified the first two features required of a Kac system. 

The next steps in the construction of this Kac system are slightly more involved. We have to 
construct the maps <pj : Xn-i xYn — ► Xn, but since there is no canonical embedding of 0(N — 1) 
into O(N), they have to be constructed "by hand", using a convenient coordinate chart. (Just as 
with the original Kac model, the maps 4>j cannot be continuous since Xn-i x Yn and Xn just do 
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not have the same topology. But just as in that case, we are only concerned with measure theoretic 
properties of these mappings, and on a set of full measure they will be well behaved.) 
Let 

v- = (0,0,. ..,0,-1) 

denote the "south pole" in S N_1 . We may use the stereographic projection to identify S N ~ 1 \{v-} 
with ZR^" 1 . At each point of IR^" 1 we of course have the standard orthonormal basis. The 
stereographic projection, which is conformal, caries this back to an orthogonal basis for the tangent 
space at the corresponding point in S N ~ 1 \{v-}. Normalizing these vectors, we thus obtain a 
smoothly varying frame of orthonormal vectors 

{u 1 (v),u 2 (v), ■ ..u N -i(v)} 

in IR N parameterized by v in S' Ar_1 \{?7_}. For each v, they form an orthonormal basis for the 
tangent space to S 1 ^ -1 at v. 

Now for each v in S' Ar_1 \{i;_}, define U(v) to be the (N —1) x N matrix whose jth row is Uj(v). 
Let the jth row of U(v-) be ej. The map v <— ► U(v) is now defined on all of S N_1 , though of course 
it is discontinuous at the "south pole", V-. This, however, will not be a problem. 

Now define the map 

<f> N : 0(N - 1) x S N ~ 1 -► O(N) 

as follows: Given G in 0(N — 1) and v in S N_1 , first form the (N — 1) x N matrix GU(v) with 
U(v) as specified above. Because G is in 0(N — 1) and the rows of U(v) are orthonormal, the rows 
of GU (v) are orthonormal. Moreover, since the rows of U(v) are a basis for the tangent space to 
S N_1 at v, each one is orthogonal to v. The rows of GU(v) are linear combinations of the rows of 
U(v), and hence these too are orthogonal to v. Therefore, if we form the N x N matrix 

" GU(v) ' 
v 



by adjoining v to GU(v) as the final row, we obtain an orthogonal matrix. 

Next observe that the determinant of this matrix is just the determinant of G. Indeed, it is clear 
that when v is the "north pole" , so that U(v) consists of the first N — 1 rows of the N x N identity 
matrix, then the determinant of this matrix is simply the determinant of G, whcih is either +1 or 
— 1. Now as v varies in S' Ar_1 \{?7_}, 



varies continuously. Hence the value is just det(G) for all such v. Continuity fails at the "south 
pole", but there it is again obvious by the special form of the definition of 4>n at v- that still in 
this case the determinant is still just that of G. Hence the image of 



(G,v) 



GU{v) 
v 



>N 



(G,v) 
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does indeed lie in O(N), and det(</>jv((5, #)) = det(G). It is also clear by construction that 

^n(4>n(G,v)) = v 

everywhere on 0(N — 1) x S N ~ 1 . 

Finally, it remains to check that this map is well behaved with respect to the measures fiN-i, 
vn and hn- Given a function / on O(N), we may compute the average of / with respect to [In in 
two stages as follows: First compute the conditional expectation g(v) where 

g(v) = E{f(G)\7T N (G)=v} . 

Then 



I f(G)dfi N (G) = [ g{v)dv N {i 

JO(N) JS 1 *- 1 



'O(N) JS N 

Also, it is clear that we can compute the conditional expectation E{f(G) | ttn(G) = v} by averaging 
over orbits generated by left multiplication of G by elements G' belonging to the subgroup of O(N) 
consisting of orthonormal matrices whose final row is ejv- This subgroup is just a copy of 0(N — 1), 
and so 

g(ir N (G)) = [ f(G'G)d m _ 1 (G') . 

JO(N-l) 

It follows directly from this that 



/ f(G)d m (G)= [ [ f(<p N (G',v))dVN-i(G') 

JO(N) Js^- 1 JO(N-l) 



du N (v) 



We define (f)j for j = 1, 2, . . . , N — 1 in terms of (j)^ and the pair permutations exchanging j and 
N, in the natural way. Clearly the analogs of the results just derived for cf>N hold for each <pj as 
well. Thus, with these definitions, the third feature required of a Kac system is verified. 

To complete the verification that the O(N) Kac walk can be made into a Kac system, we only 
need to verify that 

N 



(f, Qf)n N = jj^2 j Y ((fj,v, Qfj,v)n N ^) dv N (i 



j=l JY N 

where for each j and each y G = S' Ar_1 , 

This is clear, given the definition and computations just above. 

We are not yet ready to apply Theorems 2.1 and 2.2 to this Kac system, since we must modify 
the definition of Xn ■ We define 

X N = sup{(/, Qf) HN | \\f\\ HN = 1 , (/, 1) Hn = , (/, det)n N = } , 

which differs from (2.30) due to the restriction that / be orthogonal to the determinant. 
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As a consequence, a modification of the operators Pj is also required. In the definition (2.14) 
used in examples with a single ergodic component, we averaged over all of Xjv-i, or put differently, 
conditioned only on Yjy. Now we will condition on Yn and the ergodic component. Given / in 7~Cn, 
and j = 1, 2, . . . , N, define g + and g_ on Y^ = S N ~ l as follows: 



'O(JV-l) 

and 



9+(v) = 2 [ / J> -(G)d / x JV -i(G) 

JOfJV-lU 



2 / / ]l? (G)d^-i(G) 

JO(N-l)- 



9-{v 

'O(N-l) 

The definitions are such that 

9±{v) = E{/ | 7T j = v and det = ±1 } . 

The factors of 2 are because 0(N — 1)± each accounts for exactly half of O(N) by volume. We 
now define 

g+MG)) ifdet(G) = +l 
g-(TTj(G)) if det(G) = -1 . 



PjfiG) 



Since Pjf depends on G only through v = iTj(G), it is again convenient to abuse notation by writing 
Pjf(v). 

The point of the definitions is the following: Note that / in Hn satisfies both (/, V)n N = an d 
(/, det) -h n = in case it satisfies both 

f f(G)dfi N (G)=0 and f f(G)dfi N (G)=0. 

JO(N) + JO(N)_ 

But in this case, by the definition of Pj, 



IO(N-l)+ 

and 

/ 

)(JV-1)_ 



/ (f^(G) - P 3 f{v)) d^-iCG) = 



/ (fj,v{G) - Pjfiv)) d//jv-i(G) = , 



for almost every v. Hence (jj,${G) — Pjf(v)j is, for almost every v, orthogonal to both 1 and det 

on 0(N — 1). This is the key requirement for the proof of Theorem 2.2 to hold with the modified 
definition of Aat. We leave to the reader the easy verification of this. Of course the definitions of the 
quantities fi^ in (2.21), and kn in (2.22) have to be modified in the same way as was the definition 
of Ajv- With these modifications made, the analog of Theorem 2.1 holds as well, and again, as the 
proof is essentially the same, the verification of this is left to the reader. We summarize this by 
saying that the presence of more than one ergodic component can be taken into account within the 
framework of ideas described in section 2 by conditioning not just on the single particle space, but 
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on the ergodic components as well. The present model is a case in point, which we choose to leave 
as an example rather than attempting a general formulation. 

With these results in hand, our task is to compute the spectrum of the K operator for this 
system, which is still defined through (2.16). This is what is used in the proof of Theorem 2.1, 
though (2.19) no longer holds due to our modification of the definition of Pj. 

However, it is clear from the definition of K in terms of conditional expectations, through (2.17), 
that 

Kg(v) = [ g{W)dv N _ 2 {W) , (6.2) 

JW-v=0 

where the integral on the right is computed with respect to the uniform probability measure on 
S N ~ 2 , identified with the subset of S 1 ^ -1 consisting of those unit vectors W for which W • v = 0, as 
indicated in the limits of integration. That is, the value of Kg(v) is just the "equatorial average" 
of the values of g around the "equator" with respect to a pole running along v. 

It is immediately clear that K preserves the space of polynomials of any fixed degree d, and 
hence the eigenfunctions of K are the spherical harmonics on S N ~ X . 

The zonal spherical harmonics of degree d are those that depend on W only through e • W for 
some fixed unit vector e. Let Zd te {W) denote the corresponding zonal spherical harmonic, and let 
Pd(x) be the polynomial of degree d so that the zonal spherical harmonic Zd, e (W) on S^ -1 is given 
by 

z d , e (W)=Pd(e-W) . 
The normalization is fixed so that the reproducing kernel property holds: 

f h(W)z dte (W)du N - 1 (W) = h(e) 
Js N - 1 

for any spherical harmonic h of degree d on S 1 ^ -1 . This means that 

\\zd,e\\l =Pd(e-e) =Pd{l) ■ 
Now fix v, and let h(W) be any spherical harmonic of degree d that is orthogonal to Zd,%- Then 

f h{GW)&n N _ x {G) = (6.3) 

JO(N-l) 

where G runs over those rotations of IR N that fix v. This is because the left hand side is a spherical 
harmonic of degree d that depends on W only through v ■ W. This means that it is a multiple of 
Zd,v- However, since h was orthogonal to Zd,$, so is the average, and hence the identity (6.3) is 
established. But comparing (6.2) and (6.3), we see that 

Kg{v) = [ g(GW )dfi N . 1 (G) 

JO(N-l) 

for any W with W ■ v = 0. Hence, under our assumptions on h, Kh(v) = 0. 
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Now let g be any spherical harmonic of degree d. Let P$g be defined by 

Pv9{W) = -4- ( I giW^Zd^W'^UN^iW'))] z d j(W) (6.4) 
Pd{i-) KJs"- 1 J 

which is simply the orthogonal projection of g onto the span of z^a- Evidently g — P^g is a spherical 
harmonic of degree d that is orthogonal to Zd,v, and hence by the above, K(g — Pvg)(v) = 0. It 
follows that 

Kg(v) =K{Pvg){v) . 

But the right hand side is easy to compute as clearly Kzd t v = Pd(fy- Now by (6.4) and the 
reproducing kernel property, we have 

„ Pd(0) _ 

Kg{v) = MV) 9{V) ■ 

Now it is possible to compute the ratios pd(0)/pd(l) using generation functions, though it would 
not be so clear from this which value of d gives the largest ratio. However, none of this is necessary: 
If we fix any direction unit vector e, and take any function <j> on [—1,1], we have 

K(</>(v-g)) = (K Kac <j>) (v-e) 

where K^ ac is the K operator for the original Kac model, whose spectrum we have computed in 
Theorem 3.1. Hence the eigenvalues of K and i^Kac coincide and are provided by Theorem 3.1 
This solves the eigenvalue problem for K. 

Theorem 6.1 Every spherical harmonic 4> on S N_1 of degree d, considered as a function <f> on 
O(N) through 4>(G) = <p(Gw) for any fixed vector w in S' Ar_1 ; is an eigenfunction of K, and the 
corresponding eigenvalue is p d (0) / 'Pd(l) • These eigenvalues are the exactly the eigenvalues of the 
K operator for the original Kac model that are given in Theorem 3. 1 

Therefore, (3.3) and (3.4) hold just as in Section 3, and we have once more that 

liV + 2 A 

It remains to calculate A 2 . But 0(2) is just two copies of S 1 , and the same Fourier analysis 
argument described around (1.24) once more gives us 



A 2 = supj / p(9) cos(k9)de\ 

fe/0 U-7T J 



for the second largest eigenvalue of Q when N = 2. 

The generalization of Maslin's result to nonuniform p now follows immediately from what has 
been said above. 

Altogether, we have proved: 
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Theorem 6.1 The spectral gap for the Kac walk on SO(N) with the transition operator Q given by 
(6.1) coincides exactly, for each N > 2, with the spectral gap for the transition operator Qxac of the 
corresponding Kac walk on S N ~ 1 , as given in (1.7) with the same density p. When p is uniform, 
the multiplicity of the corresponding eigenvalue equals the dimension of the space of fourth degree 
spherical harmonics on S*^ -1 . 



7: Analysis of Maximizers for Nonuniform p(8) 

We return to the Kac model on S' Ar_1 with a non uniform density p(0), so that Q is given by 
(1.7), and we examine the circumstances under which the quartic function f^ given in (1.19) is an 
optimizer for (1.9). Because of the very close relation of the Kac walk on SO(N) to this model, as 
described in Section 6, our analysis is readily adapted to that model as well, though we shall not 
carry out the adaptation here. 

According to Lemma 3.2 /jv is an eigenfunction of Q with the eigenvalue 

l-^<f + 2 ' (7.1) 



where 



We therefore define Tn by 



N(N - 1 

7 = \ J {I - co S (A0))p{6)&6 . (7.2) 



_ 2 7 (iV + 2) _ / (f N ,Qf N ) \ 

Tn - -jN=Tr - N y 1 - -jf^- ) > (7 - 3) 

so that if /at happens to be a maximizer for (1.9), then Tjv = A^r, but otherwise < T^. That 
is, 

A N <T N , (7.4) 

and there is equality in (7.4) if and only if /jv is a maximizer for (1.9). 

Now the operator Q commutes with permutations so the permutation invariant functions / 
constitute an invariant subspace S of Hn,e, and clearly this subspace includes /at. It is the 
symmetric densities / that are relevant in Kac's limit theorem relating the Master equation (1.8) 
and the Kac equation (1.16). We will therefore restrict our attention to this subspace, where the 
result is especially striking, and which is physically the most significant. We therefore define Ajv 
to be the second largest eigenvalue of Q restricted to S: 



X N = sup<! -j-(f,Qf) 



Y f€S, ||/|| 2 = 1,(/,1)=0 \ , (7.5) 

and we define A^v = N(l — \n)- Clearly 

A N <T N , (7.6) 
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and there is equality in (7.6) if and only if is a maximizer for (7.5). 

Taking the symmetry constraints into account it is easy to compute A 2 using (1.24), with the 
result that 

A 2 = 2min f (1 - cos(k0)) p{9)d9 . (7.7) 

fc >! J 

We see from (7.1) and (7.2) that if the supremum in (7.7) occurs at k = 4, then 

A 2 = T 2 . (7.8) 

It is easy to see that A N and Atv-i are still related by the inequality proved in Theorem 1.1 for 
An and Ajv-i: 

A N > (1-k n )A N -i . (7.9) 

We also know from Theorem 1.3 and the definition of Tat in terms of /n that Tn solves this same 
recursion relation: 

T N = (1-K N )T N _ 1 . (7.10) 

Notice that (7.8), (7.9) and (7.10) together imply that A at > T N for all N > 2. But from (7.6) 
we have An < Tn for all N > 2. Hence An = Tjy and /n is a maximizer. 

We see that if / 2 is the maximizer for N = 2, the Jn is a maximizer for all N. So far we are 
simply translating old results into the symmetric case, but we have relied less on explicit calculation 
in order to bring out the following point: Suppose we had any sequence of admissible functions gx 
for the variational problem (7.5), and we defined IV by 

T N = N(l-(g N ,Qg N )) . (7.11) 

Then if it happened that the Tn so defined satsified the recurrence relation (7.10), and also satsified 
(7.8), it would follow by simple comparison that An = Tjv for all N > 2 and gN would be a 
maximizer for (7.5). All that was required of gN is that (7.11) leads to a solution of (7.10), and 
that (7.8) holds. 

The following simple observation leads to further progress: Suppose that the minimum in (7.7) 
does not occur at k = 4, and so (7.8) is false. But suppose that for some iVo, Atv =Tn - That is, 
suppose that for N = N , f'N is a maximizer for (7.5). Then /jv is a maximizer for (7.5) for all 
N > N , so that An = Tn for all N > N and therefore 

lim A N = lim = 27 , (7.12) 

N— >oo N— >oo 

which is the result we would have gotten if the maximizer for N = 2 had been quartic. Therefore, 
either (7.12) holds, or else /jv is never a maximizer for (7.5) for any N 

Now we know that spans the second eigenspace of P corresponding to its second largest 
eigenvalue, where P is the operator whose second largest eigenvalue fiN is the key to the recursion 
in Theorem 2.2. (Recall that P does not depend on p). If for each N , the true maximizer /i^r for 
(7.5) is orthogonal to /at, which is the case whenever is never a maximizer for (7.5), then we 



46 



can replace [In in Theorem 2.2 by a smaller number (In, and hence can replace kn in (7.9) by a 
smaller number kn ■ As we shall see, it turns out that this strictly smaller number kn is 

105 

~ KN = as{N) = (N + 5)(N + 3)(N + 1)(N-1) ' (7J3) 
where a(8) is an eigenvalue of the K operator as described in Theorem 3.1, while 

k n = a(4) = 



(N + l)(N-l) ' 

In summary, we have two things working for us: 

Either (7.9) holds with a kn replaced by a strictly smaller number for all N , or else there is an 
N so that A N = T N for all N > N . 

Now when k n is replaced by Rn in (7.9), it leads to a much better lower bound for A N . But this 
improved lower bound cannot violate (7.6). If it does, it can only mean that the second alternative 
holds and not the first. 

This argument leads to the following result: 

Theorem 7.1: (Conditions for /jv to Maximize for Large N) Assume that 

A 2 > 0.45r 2 . (7.14) 



then for all N sufficiently large, 

A N = T N (7.15) 

and /at is the corresponding eigenfunction. 

Proof: All of the key ideas have been explained above, and it only remains to check the details. 
As we have seen, if (7.15) does not hold for all sufficiently large N, then /jv is orthogonal to the 
true gap eigenfunction h^ for all N, since Q is self adjoint and /at is always an eigenfunction. 

This means that is orthogonal to the constant function and to the function f^- Now we 
repeat the induction argument in the proof of Theorem 2.2 once more but for the constraint that 
/iat is orthogonal to both 1 and /at. Under these new conditions, we obtain the recursion 

Aat < ( Aat_i + (1 — Aat_i)/2jv ) 



in place of (2.31), where £ln is given by (2.21), except that now we require / to be orthogonal to 
both 1 and /jv- Again, Theorem 2.1 shows that jx^ can be computed in terms of the eigenvalues of 
K, with the result that it is a(8) that is now relevant, not a(4), due to the new constraint. (Recall 
that ae(N) is negative, and so is irrelevant.) Hence 

A N > A jV _ 1 (l-a 8 (AT)) , (7.16) 
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The inequality (7.16) can be solved recursively to yield 



A N > 90 nN-mN + 7)T(N + S + i^T(N + 3-iVE)) ~ 
r(AT)r(5 + i v / 6)r(5 - iy/E)T(N + 6)T(N + 4)r(iV + 2) 



With the help of the relation 



T(z)T(l -z) = 



7T 



the limit as N — > oo of (7.17) can be computed and yields 

,. • n x ^ 3 sinh(\/67r) r _ ~ 

hmmf A N > )= -A 2 =: LA 2 . 

n^oo N ~ 770 V6vr 

This quantity has to be compared to 27 and this shows that whenever 27 < LA2 (which is easily 
shown to be implied by (7.14)) there exists some finite iV beyond which /jv is the gap eigenfunction. 
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